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Abstract 

We consider integer programs on polytopes in M" with m facets whose normal vectors are 
chosen independently from any spherically symmetric distribution. We show that for m at most 
2^, there exist constants ci < C2 such that with high probability, this random IP is infeasible 
if the largest ball contained in the corresponding polytope has radius less than ci -y/log {2m /n) 
and it is feasible if the radius is at least C2 \/log {2m/ n). Thus, a transition from infeasibility 
to feasibility happens within a constant factor increase in the radius. Moreover, if the polytope 
contains a ball of radius f2(log {2m/n)), then there is a randomized polynomial-time algorithm 
to find an integer solution with high probability (over the input). Our main tools are: a new 
connection between integer programming and matrix discrepancy, a bound on the discrepancy 
of random Gaussian matrices and Bansal's algorithm for finding low-discrepancy solutions. 
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1 Introduction 

Integer Linear Programming (IP) is a general and powerful formulation [19] widely used to address 
combinatorial problems. One standard variant is the integer feasibility problem: given a polytope P 
specified by linear constraints Ax < b, find an integer solution in P or report that none exists. The 
problem is NP-hard and appears in Karp's original list [12]. Dantzig [6] suggested the possibility 
of IP being a complete problem even before the Cook-Levin theory of NP-completeness. 

Even though integer programs are NP-hard, they are solved routinely in practice. Most LP 
solvers in the market have an IP solver that employ heuristics based on cutting plane and branch 
and bound techniques to solve the IP. In spite of a vast and rich literature on the theory of cutting 
planes, there are few complexity guarantees apart from an exponential bound due to Gomory for 
binary integer programs [9j. The best-known rigorous bound on the complexity of general IP is 
n^W) pT]. 

Some special cases of IP can be solved efficiently, e.g., IP instances where the constraint ma- 
trix A specifying the polytope is totally unimodular; more generally, when the integer hull of the 
polytope has an efficient separation oracle (using the ellipsoid algorithm p2]). Motivated by the 
question, "which IP's can be solved in polynomial time?", we consider probabilistic integer pro- 
grams, as defined next. 

Model. A random integer program P = P{n,m,xo,R) is generated as follows: we pick a random 
m X n matrix A with i.i.d. rows from a spherically symmetric distribution; and a vector b such 
that the hyperplane for each constraint is at distance at least R from xq, i.e., 6, > -|- AiXQ, 

where Ai is the i'th row of A. 

The condition above implies that P contains a ball of radius R centered at xq. We study the 
feasibility of P as a function of the radius R. Our existence bound is independent of the center xq 
and our algorithm for finding a feasible solution is only given A, b. 

1.1 Results 

We are able to prove the following bounds on the feasibility of random integer programs. 
Theorem 1. Let m > lOOOn and 

Ro = J-log— , = 128 f A + ^"'"^ ^'"^^ . 

V6n \\n\ n I 

Then, with probability at least 1 — 2m-e~"/^^, 

1. for every xq £ R", the random polytope P{n,m, xq, Ri) contains an integer point, and 

2. for Xq = (1/2, . . . , 1/2), the random polytope P{n,m, xq, Rq) does not contain an integer 
point. 

We note that for m = 2^^^\ the second term in Ri is of the same order as the first and so the 
two thresholds are within a constant factor of each other. When m = 0{n), the transition between 
infeasibility and feasibility happens between two absolute constants. 

The upper bound in Theorem [1] is based on showing that given m vectors Ai, . . . , A^ G R" 
such that each Ai is a random unit vector, and a point xq G M", there exists a point x G 
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satisfying |74j(x — xq)] < Ri for every i £ [m]. To prove this, we consider the discrepancy of a 
random Gaussian matrix. 

Theorem 2. Let A G l^™x" a random matrix with i.i.d. entries from N{0, o"^). For any xq S M", 
with high probability, there exists a point x G Z" obtained by rounding each coordinate of xq either 
up or down such that, for every i G [m], 

\Ai{x - xo)\ < ]^ay/nRi. 

The upper bound in Theorem [T] follows from Theorem O by choosing cj^ = 1/n and observing 
that m random unit vectors are obtained by scaling each row by at most a constant with probability 
at least 1 — 2me~'^/'^^ . In terms of classical discrepancy theory, Theorem [2] is equivalent to a bound 
of {aRl^/n/2) on the linear discrepancy of a random Gaussian matrix. 



Algorithm to find an integer point. We complement our existence theorem with an algorithm 
to find an integer point. With R = VL{\J\ogm) and xq = (1/2, ...1/2), there is a trivial algorithm — 
pick a random 0/1 vector. Most such vectors will be feasible in P{n,m, xq, R). But with smaller 
R, and arbitrary centers xq, only an exponentially small fraction of nearby integer vectors might 
be feasible, so such direct sampling/enumeration would not give a feasible integer point. 

Theorem 3. Given a random IP P = P{n,m,xo,R) where 

R>2^^( log ^ + Aog^l5Siog ^] 
\ n V n \ogm I 

there is a randomized polynomial-time algorithm to find an integer point x G P with probability at 
least 1 - 2me-"/96. 

Our algorithm builds on Bansal's semidefinite programming based method for finding low- 
discrepancy solutions [1]. His algorithm finds a vector X £ {—1, +1}" with discrepancy 0{^/rllog {2m/n)) 
when A is any 0/1 matrix. We adapt his algorithm to find a vector x G {0,1}'^ such that 
\\A{x — xo)||oo is 0(cr-^/n log (2m/n)) when each entry in A is from A^(0,(T^). We note that the 
resulting algorithm can be viewed as a general rounding method for integer programming; we are 
able to analyze it only for random IPs. 



1.2 The connection to discrepancy 

The main conceptual contribution of our paper is a connection between integer feasibility and 
discrepancy theory [151 1201 E] • 

Suppose we seek —1/1 points in the polytope (as opposed to integer points). Given a matrix 
A G M™^", consider the polytope P{A,r) = {x G : |j4jx| < r V « G [m]} for a fixed positive r. 
The discrepancy of a matrix A is defined to be the least r so that the polytope P{A, r) contains a 
—1/1 point. More formally, 

disc(A) := min ll^xll 

xe{-i,+i}" °° 

Thus, if we can evaluate the discrepancy of the constraint matrix A describing a polytope, then we 
can obtain a characterization of — 1/ 1-feasibility of the polytope. 
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The following related notion of linear discrepancy helps in characterizing integer feasibility of 
the polytope: 

lindiscfA) := max min ll^fx — xn)!!^ ■ 

xoe[0,l]"a;e{0,l}" °° 

We observe that every polytope P = {x £ \Ai{x — xq)\ < hi for i G [m]} where hi > lindisc(A) 
contains an integer point for every Xq E M". This is because, by linear transformation, we may 
assume that xq is in the fundamental cube defined by the standard basis unit vectors. Thus, if each 
row of the constraint matrix is a unit vector, then linear discrepancy of the constraint matrix gives 
one possible radius of the largest inscribed ball so that the polytope is integer feasible for every 
center xq. 

This approach to verify integer feasibility of a polytope fails for arbitrary polytopes since it is 
NP-hard to find the discrepancy of a set-system to within a factor of y/n [5]. We show that this 
approach can still be used for random polytopes due to tight bounds on the discrepancy and the 
linear discrepancy of random matrices. 

The central quantity that leads to all known bounds on discrepancy and linear discrepancy in 
the literature is hereditary discrepancy defined as follows: 

herdisc(^) := maxdisc(^'^) 

5C[n] 

where denotes the submatrix of A containing columns indexed by the set S. The best known 
bound on discrepancy and hereditary discrepancy of arbitrary matrices is due to Spencer |20j . 



Theorem 4. f^Oj/ For any matrix A G ]^™x" qj^c? any subset S C [n], there exists a point z G 
such that 



\Ai^ z\ < 11\ I \S\log max \Aij\. 



2m 

\S\ iG[m],je5 



for every i G [m] . 



Lovasz, Spencer and Vesztergombi |14j showed the following relation between hereditary dis- 
crepancy and linear discrepancy. 

Theorem 5. 114^ For any matrix A, lindisc{A) < herdisc{A) . 

Hence, every polytope P = {x £ M"| \ Ai{x — xo)\ < bi for i £ [m]} where 



bi = Q{ max l^jjl \/ ^ log(2m/n)) 

iG[m],je[n] 

contains an integer point for every Xq £ M". 

In our setting, each entry Aij is from A^(0,cr^). Using standard concentration for \Aij\ and a 
union bound to bound the maximum entry \Aij\ leads to the following weak bound whp: every 
polytope P = {x £ M"| \Ai(x — xo)\ < bi for i £ [m]} with bi = Q,{ay^nlog mn log (2m/n)) contains 
an integer point for any xq £ M". We strengthen this result using a nonstandard normalization in 
the proof of Spencer's result. 

Our infeasibility threshold is also based on discrepancy. We begin with a lower bound on linear 
discrepancy of random matrices, which excludes any 0/1 vector from being a solution for the chosen 
radius of the inscribed ball, then extend this to exclude all integer points. 
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1.3 Related work in IP 

Probabilistic instances of several combinatorial problems have been studied and shown to have effi- 
cient algorithms, e.g., random knapsack [2], feedback vertex set, largest clique, chromatic number, 
etc in random graphs [4J. These algorithms are combinatorial in nature. To our knowledge, the 
first work on probabilistic instances of IP was by Furst and Kannan [8j . They consider a probabilis- 
tic instance of the subset-sum problem. The subset-sum problem is to find x € {0, l}" satisfying 
Ax = b where A £ N", 6 € Z or output a short certificate of infeasibility. In the probabilistic in- 
stance that they consider, the coefficients of A are drawn uniformly at random from the discrete set 
{1, 2, . . . , M} for some large M. For M > 2" /2+2nj^3n/2^ they show that there exists a polynomial 
time deterministic algorithm to solve the subset-sum problem with high probability for any b. 

Pataki et al. [T7] generalize this further to address probabilistic integer programming problems 
with m constraints where each entry in the constraint matrix is drawn i.i.d. from {1, . . . , M}. They 
consider the integer feasibility problem of the polytope defined by the following linear constraints: 



where A G {1, 2, . . . , M}""''", li,wi G M™, l2,W2 G M". They show that if each entry in the 
constraint matrix A is drawn uniformly at random from the discrete set {1,2,..., M} where M > 
^2{"+4)/2 ■u;2) — (/i ; || )"^^'"~'"^'' ; then there exists a polynomial time deterministic algorithm 
that solves the integer feasibility problem with high probability. 

In other related work, Beier and Vocking [3] and Roglin and Vocking [18j give a smoothed 
analysis for some special cases of IP. 

2 Preliminaries 

We will use the following standard tail bounds. 

Lemma 1. Let Y be a random variable distributed according to N{0,a^). Then for any t > 0, 



Lemma 2. If X is drawn from the Gaussian distribution A^(0,c7^), then for any any A > 1 

Pr(|X| > Xa) < 2e-^. 

Lemma 3. JT]/ Let = Xq = Xi, • • • , Xn be a martingale with increments Yi = — Suppose 
for 1 < i < n, we have that yj|(Xj_i,--- ,Xo) is distributed as Gaussian N{0,rif) where r]i is a 
constant such that \r]i\ < 1. Then 



h<AX< wi 
h<X <W2 




Pr 

Lemma 4. // random variables Xi, 
iV(0,cr2), then for any A > 



n 



> XV^) < 2e-^'/2. 



• ,Xr are drawn i.i.d. from the normal distribution 
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Lemma 5. For any subset S C [n] and for any fixed set of vectors Oi, i ^ [m], if each coordinate 
Xj, j G [n] is drawn uniformly at random from the set { — 1,+1}, then 



Pr I \ ^aijXj\ > A I < 2e 

For a matrix A G ^^^n g ^ j^^j^ -^qI ^-s denote the vector Ai restricted to the 

coordinates in S. We say that the discrepancy of a vector Ai due to x is at most tj if \Aix\ < ti. 
We say that x incurs a discrepancy of at most t if maxjgj^j \Aix\ <t. 

3 Linear discrepancy of a random matrix 

By Theorem [5l it is sufficient to bound the hereditary discrepancy of random matrix to obtain 
an upper bound on the Unear discrepancy of random matrix. We show the following bound on 
hereditary discrepancy of random matrix. 

Theorem 6. Suppose we have m vectors Ai,--- ,Am G K", such that Aij is drawn from the 
distribution N{0,a^) for each i G [m\,j G [n]. Then, for any S C [n], with high probability, there 
exists a point x G { — 1, l}'"^' such that, 



\ n\og h •( / log m log mn log :; for every i G [ml. 

V n V logm / 

Remark: The bound on the discrepancy of submatrix ^4^ given in Theorem [6] is independent of the 
size of S. This is unlike Spencer's result (Theorem [H where the discrepancy of A^ is bounded by 
a function of 

Our overall strategy is similar to that of Spencer (and subsequent work): We first show that 
there exists a point z G {0, — 1, with at least \S\/2 non-zero coordinates such that \Af z\ is 
small. We start with x = 0, S = [n] and use z to fix at least half of the coordinates of x to +1 or 
— 1. Then we take S to be the set of coordinates that are set to zero in the current x and use z to fix 
at least half of the remaining coordinates of x to +1 or —1. We repeat this until all coordinates of x 
are non-zero. Since at most 1 5*1/2 coordinates are set to zero in each round of fixing coordinates, we 
will repeat at most log n times. The total discrepancy is bounded by the sum of the discrepancies 
incurred in each round of fixing. The discrepancy incurred by fixing those coordinates of x which 
are non-zeros in z is bounded by 

for all vectors Ai, i ^ [m] and all subsets 5 C [n]. 

This general bound depends on the length of the vector Ai^ . It is straightforward to obtain 
ll^i^ll < 2(7 I S"! log mn whp in our setting by bounding the maximum coefficient. This gives a 
bound of 

I 2m 
8crJ|5|log (mn)log — 
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on the discrepancy of , i.e., A restricted to any subset S of columns. This bound on the discrep- 
ancy of A^ is good enough when the cardinahty of S is smaller than some threshold, but too large for 
large S. E.g., when S = [n], this gives a total discrepancy of at most Oia^Jn log {ran) log {2m/n)). 

Another possible approach is to bound the length of vector Ai^ when each entry in the vector is 
from A^(0, o"^) (without bounding the maximum coefficient): using LemmaSl for any fixed S C [n] 
and i G [m], 

Pr fl \\AiSf - \S\a'^\ > Acj2) < 2e~^. 



By union bound, we get that 

Pr (3S C [n],i G [m] : I \\Ai, 





a2 






< 2e 


24|S 


■ (i^i) 


• m 








< 2e" 


" 24|SI 







Thus, taking A = |5| v^48(log(n) + (1/|5|) log m) we get \\Ai'^\\ < 48cj|5| y^log n + (l/jS'l) log m for 
every i £ [m] and S C [n] whp. Therefore, the discrepancy incurred in each round of fixing is at 
most 

196(j\S\^ ^logn + i^logm^ ^og^ 
and the total bound on discrepancy of ^4^ is at most 



This bound is still large (e.g., this gives the total discrepancy to be at most ©(crny^lognlog {2m/ n)). 
In fact, when each entry is from A^(0, cr^), it is possible that there exists a subset of coordinates 
S C [n] such that the length of Oj'^ is i7(cT|S'|). 

However, in order to bound the total discrepancy, we only need to bound the length of the 
remaining vector after each round of fixing. Let S denote the set of coordinates to be fixed in 
the current round. The existence lemma (Lemma [6]) picks some subset from S of at least 1 5*1/2 
coordinates to fix so that the discrepancy is at most 4||AiS|| y^log(2m/|5|). Hence, it leaves at 
most \S\/2 coordinates among the possible |5| coordinates for the next round. It is sufficient to 
bound the probability that there exists a subset T C 5 of size at most \S\/2 such that the length 
of the vector Ai"^ is large. We do not need the length of Ai"^ to be small for every subset T C [n]. 
Thus, the union bound is only over the choices of the coordinates yet to be fixed (subsets of S 
of size at most |5'|/2) and not over all possible subsets of coordinates. We use this approach in 
Lemma [TOl to obtain a stronger bound on the length of the vectors Ai'^k for every i G [m] and every 
collection of subsets {81,82, ■ ■ ■ ,8^) where 8^ C 8k-i and \8k\ < n2~^. This helps us obtain the 
tighter bound for hereditary discrepancy as stated in Theorem [6l 

3.1 Bucket entropy for matrices 

We follow the outline of the entropy method by Spencer |20j to derive the following bound. 
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Lemma 6. For any set of vectors Ai, . . . ,Am S and any subset S CI [n], there exists a point 
z G {O.-l.+l}!-^! with at least \S\/2 non-zero coordinates such that 



\AiSz\<8\\AiS\\^log—yi€ [m]. 

Our proof of Lemma [6] is nearly identical to the known classical proof. However, since we use 
a different normalization (||a|| rather than ||a||oo), the statement we need is not a direct corollary 
and we have to formally run through the proof for completeness. 



3.1.1 Proof idea 

The proof is by the probabilistic method. We show that there exist two vectors x,y € {+1, —1}''^' 
such that 

1. lAj^x — Ai^y\ is small for every i G [m] 

2. X and y differ in a large number of coordinates. 

Thus, taking z = gives a vector z so that z £ {0, — 1, and z has a large number of 
non-zero coordinates. Further, since |^j'^(x — y)\ is small, \Ai^ Z\ is also small for every i £ [m]. 

In order to show that there exist vectors x,y £ {— satisfying condition 1 above, we 
consider the value for every x G { — 1,+1}I'^L We show that there exist x,y € {— Ij+l}!*^' so 

that the difference between |j4j^x| and is small for each i G [m]. For this, we consider a real 

line for each i G [m] and equi-partition the i'th line into small parts for each i G [m]. Then, we show 
that there exist an exponential number of vectors x G { — such that their corresponding 
values fall in the same part for every i G [m]. Thus, we get a set containing exponential 
number of vectors in {—1, +1}''^' so that for any pair of vectors x, y in this set, — is at 

most the length of each part corresponding to i G [m]. Therefore, we have an exponential number 
of vectors satisfying condition 1. 

Finally, since an exponential number of vectors x G { — 1, +1}''^' satisfy condition 1, there should 
exist at least two such vectors x and y with large hamming distance. Thus, among the set of vectors 
satisfying property 1, there should exist at least two vectors satisfying property 2. 

Notation. We define the following function for equi-partitioning. For any A > 0, define buckets 

B^:= [-A,A] 

for every positive integer /, := {{21 — 1)A, {21 + 1)A] 

B^_l := [-(2/ + l)A,-(2/-l)A). 

Suppose we have a real vector t = (ti,--- ,tm)- Then define the bucketing function = 
{PI^ (a;), • • • , {x)) where 

Phx)=j if Y.A,Xj£Bl 
Thus, the length of each part in the equipartition for the i'th vector is 2tj. 
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3.1.2 Proof of Lemma [6] 

The following lemma shows the existence of a point z with discrepancy of Ai due to z being at 
most ti for each i G [m] if the entropy of the bucketing function is small. 

Lemma 7. Suppose x G {— 1,+1}" is chosen uniformly at random. //ENT < l/SI/S, then 

there exists a point z G {0, — 1, with at least \S\/2 non-zero coordinates such that 

z\ < ti for each i G [m]. 

Proof of Lemma\2\ Let r = \S\. Since ENT (P*(x)) < |, there exists a vector b = {bi, . . . ,bm) 
such that Pr {P\x) = b) > 2 5 . Since total number of possible choices for x is 2^ , at least 2 s of 
the choices for x should map to b. This implies that there exist x, y which differ in at least r/2 
coordinates such that P^ix) = P*{y) [13j. Taking z = completes the proof of the lemma. □ 

We use the following lemma to bound the entropy. This is very similar to Lemma 2.3 in [16]. 

Lemma 8. Let S be an arbitrary subset of [n]. Let x G { — 1, +1}'"^' be chosen uniformly at random. 
Then ENT < G{ti/ ||A^||) for every i G [m], where 



G(A) 



40e-^ i/A>0.1, 
401n(l/A) if X <{).!. 



Proof of LemmalM Suppose we pick x uniformly at random in {+1, — l}''^!. Let 

Pk :=Pr(i^^"^^'l'(x) = A; 

3Ai|A,s|| 



Then, ENT Pf"'^'^ " (x) = Zk -Pk log(Pfc)- Also, 



E (xj) = for each j G [n], 
E (xj) = 1 for each j G [n], 
E (Ai^x) = for any Ai, 

i65 



By Lemma O 
Define 



Pr {Ai'^x > A||AiS||) < e"^. 

_ A^(2fc-l)^ 

gk-= e 8 , A; > 1 
50 := 1 - 2e 8 . 

By Lemma O pk,P-k < 5fc and po ^ 5o- The function — xlogx is increasing in (0, 1/e) and 
decreasing in [1/e, 1]. 
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When A > 10, gQ>l/e and gk < i/e for A; > 1. Therefore, 

oo 

ENT(i^^"^"^"(a;)) < -50 log ffo + 2 J] log fffc 



k=l 



< 26e~ — . 



When 0.1 < A < 10, by Jensen's inequality, Y.l=-ioo -Pk^ogpk < log\K\ < 8. For |A:| > 101, 
Qk < 1/e and hence 



fe=101 fc=101 

Thus, 

ENT ( P:^"^*^"(x)l < 9 < 266" 



When A < 0.1, by Jensen's inequality, Z]fe:|fc|<A-2o —pk^ogpk < log|i^|. For \k\ > A gt < 
1/e. Therefore, 

ENt(/^^I'^^'I'(x)) <log(l + 2A-20) + 2 -9kloggk 

A::|fc|>A-20 

< 401n(l/A). 

□ 

Proof of LemmalBi Let r = \S\. Suppose we pick x uniformly at random in {+1,— l}'"^'. We 



show that ENT {P\x)) < r/5 for U = 8 \\AiS\\ ^log i G [m]. The existence of a point z G 
{0, -1,+1}I'^I with at least \S\/2 non-zero coordinates such that 

< ti for each i £ [m] 

follows by Lemma [71 

By sub-additivity of entropy function, 

m 

ENT (P*(x)) < ^ENT {Pj'{x)) . 

i=l 



Due to the choice of ti, we have that t^/ \\Ai^\\ = Sy^log (2m/r) > 0.1. Therefore, by LemmaEl 

ENT(j^*'(x)) <40e-(64/9)i°g^. 

Thus, 



ENT {P\x)) < 40me~""*i— < 



□ 
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3.2 Bounding lengths of Gaussian subvectors 

Lemma 9. If each entry Aij is drawn i.i.d. from N{0,a'^), then with high probability 

max \\Ai^\\ < 2a^/\S\log mn 

■t6 [m] 

for every subset S Q [n]. 

Proof. If each entry Aij is drawn i.i.d. from A^(0, cr^), then by Lemma[2]the maximum entry \Aij\, 
i G [m],j G [n] is at most 2a\/Togrrm with high probabihty. □ 

Next we obtain a bound on the length of Ai^ when IS"! is large. 

Lemma 10. Suppose we have a matrix A G ]^"^x"- where n > logm and each entry Aij is drawn 
from N{0, a"^). For any collection of subsets So, ^i, 5*2, . . . , Siog {«/ log m) of the set [n], where Sq C 
[n], Sk ^ Sk-i, \Sk\ < n2~^ for = 0, 1, . . . , log (n/ log m), the following holds with high probability. 

WAi'^kf <lQn2'^a'^ (1) 

for every i G [m] and k = 0,1, . . . , log (n/logm). 

Proof of Lemma \JU Let S" be a collection of subsets So,Si, . . . , ^log („/ log m) of [n] such that 5*0 ^ 
[n],Sk ^ Sk~i and \Sk\ < n2^^ for k = 0, 1, . . . , log (n/ log m). We will show that ^ holds for 
every possible 5. 

We say that a subset of the collection S is heavy if there exists i G [m] such that it violates 
([I]). We denote a collection S of subsets to be heavy if there exists k G {0, 1, . . . ,log (n/logm)} 
such that Sk is heavy. 

Thus, a collection is heavy if one of its subsets is heavy. We will bound the probability that 
there exists a heavy collection. Therefore, 

log (n/ log m) 

Pr (35 : 5 is heavy) < ^ Pr (3Sk C Sk-i, \Sk\ < n2~'' : Sk is heavy) 

k=0 

We bound each term in the above sum as follows. For k = 0, 

Pr{So Q [n] is heavy) < Pr (bSq Q [n],i G [m] : Pi^of > IGncr^) 

< Pr (3i G [m] : \\Aif > IGna"^^ 

< 2e~^" • m (Using Lemma H]) 
<2e-4iogm (n>logm). 

For each k = 1,2, . . . , log(n/ logm), 

Pr (3Sk C Sk-i, \Sk\< n2~^ : is heavy) 
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< 2e • -m (Using Lemma HD 



<2e-2i°g™ (fc < log(n/logm)). 



Thus, 



Pr(35 



S is heavy) < — - ■ log 



log m 



n 



) 



^ 0. 



□ 



3.3 Proof of Theorem [2] 

We prove Theorem [6] showing a bound on the hereditary discrepancy. 

Proof of Theorem\^ We use Lemma [6] repeatedly to fix the coordinates of x. We start with Sq = S. 
By Lemma[6]there exists a point zq E {0, —1, containing at most |5'o|/2 zeros. Let Si denote 

the subset of coordinates of zq that are zero. Then we set z{j) = ZQ{j) for every j ^ Si. We take 
S = Si. By Lemma [6] there exists a point zi G {0, — 1, containing at most \S\/2 zeros. Let 
^2 denote the subset of coordinates of zi that are zero. Then we set x{j) = zi{j) for every j S'2. 
We repeat this until the number of coordinates of x that are yet to be set is at most a constant 
with high probability. We set these remaining coordinates to be —1/1 arbitrarily. The discrepancy 
incurred by x due to this arbitrary setting is at most a constant. 

We use Lemma [10] to bound the discrepancy incurred when the number of coordinates to be 
fixed is greater than logm and Lemma [9] to bound the discrepancy incurred when the number of 
coordinates to be fixed is at most log m. 

By Lemma [6l the discrepancy incurred by x while setting its coordinates using subset S^, 
A; G {0, 1, ... , log (n/ log m)} is at most 




with high probability. Here, the second inequality is by using Lemma [TOl 

Thus, the discrepancy incurred by x due to zq,zi, - ■ ■ , zi^g („/iogm) is at most 




with high probability. 
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For k > log (n/ log m), the number of coordinates 15*^1 < logm. By Lemma [6l the discrepancy 
incurred by x while setting its coordinates using subset Sk, k G {log (n/logm) + 1, . . . ,log IS"]} is 
at most 



< 16(7 -i/ n2 ^ log (mn) log ■ 



with high probability. Here, the second inequality is by using Lemma [9] and 15/cl < n2~^. 
Thus, the discrepancy incurred by x due to ziog{n/\ogm)+i^ • • • ) -^log \s\ is at most 

A:=log|5| fc=log|5| 



lAj^fcZfcl < 16a \ n2 log (mn) log 

A;=log fc=log -r-^ 

log m log m 



2m 
n2^ 



, 2m 
< 32(7 W log m log (mn) log ■ 



log m 

with high probability. Hence, the total discrepancy is bounded by 



meo<.\Ai^x\ < 64a \ Wnlog h i / log m log (mn) log ■ 



ie[n] \ V n V logm / 

with high probability. □ 

Finally, we bound the linear discrepancy of random matrix (Theorem [2]). Theorem [2] follows 
from Theorems [5] and [H We give a direct proof here for the sake of completeness. Our proof 
strategy is well-known (see Corollary 8 in [20j). 

Proof of TheoremlE We will find x by rounding xq. Without loss of generality, let xq be such that 
xo{j) G [0, 1] for each j G [n]. Let the vector xq be rational. Suppose each coordinate in xq can be 
expressed in binary using at most p bits. We will round in p phases - each phase will reduce the 
number of bits needed to express each coordinate in the rounded vector by one. 

Consider the binary expansion xo(j) = Ylk=o^j,k'^~^ ^ ^ {OjI}- Let S denote the set of 
coordinates of xq which require precision at the p-th bit, i.e., S = {j : 5j^p = 1}. Now, by Theorem 
[6l there exists a point z G { — 1, +1}''^' such that 



/ / 2m / 2m 

< 64cr W 15*1 log — + Wlogmlog(m|S'|)log- 



|5| V log / 

Now, consider the following rounding procedure to obtain xi: Set z{j) = for every j ^ S and 
xi = Xq + z2~P. It is clear that the number of bits needed to express xi is at most p — 1. This is 
because, exactly those coordinates which required precision at the p-th bit were rounded. Further, 
they were rounded in a manner so that the p-th bit is set to 0. This is because z{j) G {+1, —1} 
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for every j £ S (rounding could possibly change the p — 1-th bit in each coordinate) . We also have 
that 

\Ai{xi -xo)\ = \Aiz2-P\ 



<64a 1^^151 log ^ + yiogm log (m|5|) log • 2"^ 

We repeat this rounding procedure at most p — 1 times thereby reducing the number of bits of 
precision by at least one each time. Thus, the final x obtained needs one bit of precision for each 
coordinate and hence x G {0, 1}". Finally, 

p-i 

\Ai{x -xo)\ < ^ \Ai{xk - Xk-i)\ 

k=l 

V / I 



< 64^7^2 ^' |^y|5fc|log^ + yiogmlog(mn)log^^ 
p 



< 64(7 2 [ \ln log + \ / log m log imn) log ^ | {\Sk\ <n) 
^ I V n V logm / 



, , 2m / 2m 

< 64(7 \ \\n log V \ \ log m log (mn) log 

' " n V logm 



□ 



4 Proof of the main existence theorem 

The upper bound R\ for the radius in Theorem [T] will follow from the linear discrepancy bound 
given in Theorem [2j For the lower bound, we use the following result for Gaussian matrices. 

Lemma 11. For m > lOOOn, let A £ j^^x" q matrix whose entries are chosen i.i.d. from the 
normal distribution N{0,a'^). Let xq := (1/2, ... , 1/2) G R". Then, 



Pr X G : \A,{x - Xo)\ < fy^log^ G [m]j < ^. 

We first show a bound on the radius required so that the random IP P{n, m, 0, R) contains an 
integer point with all nonzero coordinates. Lemma [TT] follows from the choice of xq. 

Lemma 12. For m > lOOOn, let A G R'"^"- be a matrix whose entries are chosen i.i.d. from the 
normal distribution iV(0,(7^). With probability at least 1 — 2"", there does not exist x G Pi {x G 
M" : \xj\ > 1 V J G [n]} such that 

\Aix\ < a\/n log {2m /n) for every i = 1, . . . ,m. 
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Proof of Lemma[TB. For each r > 0, we define the set 



Ur := Z" n {x : \\x\\ = r, \xj\ > V j € [n]}. 

We will show that with probability at least 1 — 2~"', there does not exist x £ Ur>oUr satisfying all 
the m inequalities. We first observe that Ur is non-empty only if r > ^/n. Fix r > ^/n and a point 
X G Ur- Now, for i £ [m], since each Aij is chosen from N{0,a'^), the dot product 



AiX — ^ ^ A 



ijXj 



is distributed according to the normal distribution N{0,r a ). Let 



2/71 

Px := Pr I \Aix\ < (7^/nlog V i € [m] 



By union bound, 



:= Pr [ 3x £ Ur ■■ \Aix\ < cr^/nlog — V i G [m] 



-fr ^ / -fx < I t^r I max Px ■ 



We will obtain an upper bound on Px that depends only on r. To bound the size of the set Ur, we 
observe that every point in Ur is an integer point on the surface of a sphere of radius r centered 
around the origin and hence is contained in an euclidean ball of radius r + 1 centered around the 
origin. Thus, \Ur \ can be bounded by the volume of the sphere of radius r + 1 < 2r centered around 
the origin: 

|[/r| < vol(2rBo) < ( 2rA/— ) < ( ^ 



n 



n 



Next we bound Pr- We have two cases. 
Case 1. Let r G 



n, Y^n log (2m/n) . Since AiX is distributed according to A^(0, r^cx^), by Lemma 



Pr \Aix\ < CTA/nlog < 1 



2m\ 1 / rJnlog^ 



n j"^ ^\r'^+n\og^] \2m) 
Since each Aij is chosen independently, we have that 



2=1 



<|1 ' 



v2tv \ r-^+nlog 

< e 



^2^ 1 r2 + nlog^ / \2mJ 
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Therefore, by union bound, it follows that 



1 ( ^V"'°g¥ 



Pr < e 



(")^.m+nlog^ 



Case 2. Let r > Y^^^ogT^wyn). Since is distributed according to A'^(0, r^cj^), by LemmalU we 

have that 

2m\ 1 [2 \ 2m 4 r~ 2m 



Pr \Aix\ < a\ nlog < -W — nlog < — Wnlog 

\ V n I r \ TT norV n 

The random variables Aix, . . . ,AmX are independent and identically distributed. Therefore, 



-Px = n I l^^^l log 



i=l 



n 

m 



Hence, by union bound, 



4: I 2m 

- T\h^°s — 

\ or V n 



-n I 2i log I =i£ -log 12l 



< e 



< 



Vnlog^ 
5r 



Finally, 



Pr I 3x G Ur>^C/r : \Aix\ < a\Jnlog— Vi G [m] | = ^ P, 

E = E + E 

''^^ refVS^nlog^HlJ r>^nlog2^ 



10 .1 r=^/n \ ^ J V^/ J r=\l nlog 



1 /4\^ /2A/nlog 



2m 
n 



lO'^n-l V5/ I m-2 



1 

< — (since m > lOOOn). 
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□ 

Proof of Lemma [771 There exists x G Z" such that 



\Ai{x - Xo)\ < ^Wnlog— yi € [m] 
2 V n 

if and only if there exists x G n {x G M" : Xj > 1 V j G [n]} such that 



lAjxl < a\lnlog Vi G [ml. 

V n 

The result follows by Lemma [T2l □ 

We are now ready to put everything together and prove Theorem [H 
Proof of Theorem [7J Let 

P = {x G M" : ajX < 6i V i G [m]} 

where each Oj is chosen from a spherically symmetric distribution. Then Oj = Oj/ ||aj|| for i G [m] is 
distributed randomly on the unit sphere. A random unit vector ai can be obtained by drawing each 
coordinate from the normal distribution A^(0, = 1 /n) and normalizing the resulting vector. Thus, 
we may assume ai = Ai/ \\Ai\\ where each coordinate Aij is drawn from the normal distribution 
A^(0, 1/n). Here, we show that the probability that there exists a vector Ai that gets scaled by 
more than a constant is at most 2me~"/^^. 

Taking r = n and cr^ = 1/n in Lemma [U we have 

Pr (^i G [m] : I \\Ai\\^ - 1| > < 2me~^. 

Hence, with probability at least l — 2me~^/'^^, we have that a/1/2 < \\Ai\\ < ^/3/2 for every i G [m]. 
1. Since P contains a ball of radius Ri, P ^ Q where 

Q = {x G M"i \ai{x - xo)| < Ri for i G [m]} 
Using Theorem [2] and o"'^ = 1/n, we know that there exists x G such that for every i G [m] 



, , , ., , I 2m / log m log (mn) , 2m 

|Ai(x-Xo)| <64 Wlog— + ^ ^log: 



n y n logm 

Thus, with probability at least 1 — 2me~"/^^, there exists X G satisfying 

\Ai{x - xo)| 



\ai{x - xo)| 



I A; 



< 128 I Wlog ^ + A"g^"^°g("^") log . 



n y n logm 
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for every i G [m]. Thus the polytope Q is integer feasible and consequently P is also integer feasible. 
2. For xo = (1/2,... ,1/2), let 



P = |x G : \Mx -Xo)\< WAW ^1 log ^ Vi G [m]j . 

Then, P contains a ball of radius Rq centered around xq and hence is an instance of the random 
polytope P(n, m, Xq, -Rq)- Further, with probability at least 1 — 2me~"/^®, P is contained in 



Q= jxGM": |Ai(x-Xo)| < ^^log^Vi G h|. 

By Lemma [TT| with high probability, we have that Q H = 0. Thus, with probability at least 
1 - 2me-"/96, we have that P n Z" = 0. 

□ 



5 Algorithm to find an integer point 

Next we present an algorithm to identify an integer point x in P{n,m,XQ, Ralg)- We first show 
an algorithm to find small linear discrepancy solutions for gaussian constraint matrices. 

Theorem 7. There is a randomized polynomial-time algorithm that takes as input a random matrix 
A G M™^" with i.i.d. entries from A^(0,(T'^) and a point xq G M", and outputs an integer point x 
such that for every i G [m] , 

\Ai{x — xo)\ < 2^^ a ( \/nlog h y^log m log (mn) log ^ 



n log m ^ 

with high probability. 

Our algorithm is similar to Bansal's algorithm [l]. The algorithm runs in phases. In each phase, 
we start with the current point x G [0, 1]" and perform a random walk to arrive at a partial vector 
y with at least half of the non-integer coordinates of x being integers in y. Further, the discrepancy 
overhead incurred by y (i.e., \Ai{y — x)\) is small. 



Algorithm Round-IP 

Input: Point xqGM", matrix yl G M™'<" where each Ay ~ iV(0, cj^). 
Output : An integer point x in the polytope 

P = I X : \ Aiix — xq)\ < 2^^ a ( ^/nlog h ^/log m log (mn) log ■ 



2m 



log m 

1. Initialize x = Xo. 

2. While (x is not integral) 

(a) Define S to be the set of non-integer coordinates of x 

(b) x^ Partial-Vector(A,x,S'). 

(c) If X = NULL, then abort. 

3. Output X. 
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We describe the partial vector function used in the above algorithm in the next section. Its 
functionality is summarized in the following lemma. 

Lemma 13. Given a point xq S [0, 1]", let S C [n] denote the subset of non-integer coordinates of 
xq. There exists a polynomial time algorithm that produces a vector y £ [0,1]" with at most \S\/2 
non-integer coordinates, such that 

\My-XQ)\ < 1281p.s||iog_ 

for every i G [m] with probability at least 1/2. 

In Algorithm Round-IP, we repeatedly invoke the Partial- Vector algorithm. Each such call fixes 
at least half the non- integer coordinates of x to integers. Thus, with at most logn calls to the 
partial-vector algorithm, we obtain an integer vector x. Further, the total discrepancy overhead 
incurred by x is at most the sum of the discrepancy overhead incurred in each call to the Partial- 
Vector algorithm. The sum of the discrepancy overheads is bounded similar to the proof of Theorem 
m using Lemmas [9] and [lOl 



5.1 Partial integer vector for Gaussian matrices 

We show a polynomial time algorithm that given a point xq as input, finds a point y such that the 
number of non-integer coordinates is halved and the discrepancy overhead incurred by y is small. 
This algorithm is along the lines of Bansal's algorithm Bansal's algorithm outputs a partial 
vector with small discrepancy overhead when the matrix A is a 0/1 matrix. Here, we need an 
algorithm to find a partial vector with small discrepancy overhead when each entry in A is drawn 
i.i.d. from N{0,a^). 

The vector y is constructed iteratively by performing a random walk starting at xq. In the 
t'th iteration, xt = xt~i + 7* for an appropriate choice of 7^. Let S denote the set of non-integer 
coordinates of xq. The increments 7t(i) satisfy the following properties. 

1. The increment 7t(j) is zero if j ^ S and are distributed as an unbiased gaussian with standard 
deviation at most 1 if j G S. Thus, each non-integer coordinate evolves as a martingale. 

2. The increments are such that they add up to at least s(|5'|/2) for some scaling parameter s. 
This ensures that after about 1/s^ steps, about half the coordinates are integers (or close to 
integers) . 

3. Finally, at any step t, the increments (7tO))jeS are correlated such that "^j^g Aij'jtij) is 
distributed as an unbiased Gaussian with small standard deviation (proportional to the length 
of Ai^). This ensures that the discrepancy overhead incurred is also evolving as a martingale 
with small increments. 



5.1.1 Algorithm 

We describe the Partial- Vector algorithm with the following parameters: r = \S\, u = logm, 
s = l/(4u^/^), q = log {2m /r) and for all i S [m], 

. „ , /,N 100g|UiS|i2 



A(0) := 0, A(A:) := 640g p/|| ( 2 - -J VA: = 1, 2, . . . , and aiik) := ^V' VA: = 0, 1,2, 
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Algorithm Partial- Vector ( A, xq, 5") 

Input: Vector xq with index set S of non-integer coordinates and a matrix A G 
Output: Vector x with at most 151/2 non-integer coordinates such that 

\Ai{x -xo)\< 1281 \\Ai^\\ log 1^ for every i e [m]. 

Initialize C(0) ^ S. 
Repeat for t = 1, . . . , 16/s^ : 

1. Let 

^r{A) ■= A{xt - Xt-i), G [m], r = l,---,i-l 

t-i 

Vi ■= \ ^1t{A)\ G [m],. 

r=l 

Declare a vector Ai to be /c-dangerous if r]i G [/3i(/c), + 1)] . Let S{k) := {i : 
Ai is A;-dangerous} . If r/j > 2/3(1) for any zG [m] , then abort. 

2. Find a feasible solution to the following semidefinite program: 



je[n] 

n 

II^Ai^ilP <ai(fc) G 5(A;),A; = 0,1,2,--- 

||^'jf<l VjGC(t-l) 
||^;j.f = o Vj^C(t-l) 

If the SDP is infeasible, then abort. 

3. Obtain each coordinate gk according to the standard normal distribution 
7V(0,1). 

7t(j) ■= s < g,Vj >, 

Xt ^ Xt-l+Jf 

If Xt{j) > 1 or Xt{j) < for any j, then abort. 

4. Let B{t) := {j : Xt{j) > 1 - ^ or xtij) < i}. For every j G B{t) , 



1 with probability Xt{j), 
with probability l — xt{j). 



5. Update C{t) -(^ C{t - 1)\ B{t) . If |C(t)| < |5|/2, then terminate and output xt- 
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5.1.2 Discrepancy incurred due to randomized rounding 



We first show that the discrepancy overhead incurred due to randomized rounding in step 4 is 
small. Let x' denote the vector obtained if the rounding in Step 4 is not performed. That is, x' is 
the vector obtained at the end of the algorithm and whose respective coordinates were fixed once 
they exceeded 1 — (1/u) or become smaller than Xju. 

Claim 14. With high probability, 

\Ai{x-x')\ <i\\AiS\\ Vi G [m]. 



Proof of Claim [7^ First observe that \Ai{x — x')\ < \ X^jg^ ^ij(a^O) — x'{j))\. Consider a coordi 



nate j ^ S which was rounded in step 4. Then, 



E {x{j) - x'{j)) = 
Var (x(i) - x'(j)) < 



u 

Since the variables in S are the only variables that can get rounded. 



Var :=A2< 



Therefore, for i £ [m], by Chernoff bound, 



Pr I \ Y,Aj{x{j) - x'{j))\ > 4Aiyi^ I = ^. 

jes ^ 



Hence, by union bound, we get that \Ai{x — x')\ < 4AjT/log m < 4||Aj^|| for every i G [m] with 
high probability. □ 



5.1.3 Feasibility of semidefinite program 

Next we show that the SDP during iteration t is feasible with high probability. 

Lemma 15. Suppose the number of k- dangerous vectors is at most ^2"^^^''^^^ for every k = 1,2,... 
during iteration t. Then the SDP in iteration t is feasible with high probability. 

Proof of Lemma\15[ It is enough to prove that there exists a z G {0, +1, — l}''^^*""'^^! with at least 
\C{t — 1)1/2 non-zero coordinates such that < ai{k) exists for every /c-dangerous vector 

Ai. Such a point z gives a feasible solution to the SDP. We show the existence of such a point z 
similar to the proof of Lemma [6l 

By Lemma [71 it is sufficient to show that ENT < r/5, where Aj < a/ ai{k) if i G S{k). 

We take Aj = 10 \\AiC(t-i) \\ ^/{l/{k + 1)5) log (2m/r) for every i G S{k). By sub-additivity of 
entropy function 

m 

ENT (P^(x)) < J^ENT (Pt'ix)) = ENT (p^'ix)) . 

i=l k=0,l,...i&S{k) 
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Let i G S{k). Let ({k) := S^Jq/{k + 1)5. Since, C(0) = 10^ > 0.1, by LemmaEl 

ENT (Pi\x)^ < 40me~^^'^ < ^. 

jeS(o) 

Next, suppose i G S{k) such that A; > 1. The function G in Lemma [8] is a decreasing function 
Thus, if C{k) > 0.1, then we can use G{C{k)) < 401n(10) as an upper bound for ENT (^P^^'{x) 
Therefore, 

ENT (^P,^'{x)^ < 40max(ln(10),ln(l/C(/c))) < 40max(ln(10), ln((A; + 1)^/^)) < 2001n(A: + 1). 
Thus, 

Y ENT (-F^^HX)) < ^r2~i°('=+^) • 2001n(A: + 1) < ^. 



Hence, 



fc>ljeS(fc) A;>1 

ENT (P^(x)) < 



r 
5 

□ 



Next we bound the probabihty that a large number of vectors are /c-dangerous. For this, 
observe that the increment 7t(j) = if i C(t — 1) and 7t(j) is distributed according to the 
normal distribution iV(0, s^) if i G C{t — 1). Similarly, jti^i) is distributed according to the normal 
distribution N{0,a''^) where cj'^ < ai{k)s'^ if i G S{k). This is due to the SDP constraint. 

Lemma 16. For k = 1,2,-- - , let Dj^ denote the event that more than rrik = r2^'^^^^^^^ vectors 
ever become k-dangerous during t = 1,2, . . . , 16/s^. Then, 

Proof of Lemma 1 1 61 First consider k = 1. Suppose a vector Ai becomes 1-dangerous at some 
iteration t. Then, there exists t when r/j first exceeds /3(1). Until t, rji was evolving as a martingale 
with each conditional increment drawn from Gaussian distribution with mean and variance at 
most a(0)s^. Hence, by LemmaEJ 

Pr ( 7/i > at some t = t < j < 2e "^^Im^^W^ = 2 l^—j . 

Hence, the expected number of vectors that become 1-dangerous is at most m(r/2m)^^ < r2~'^^. 
By Markov's inequality 

^0-30 

Pr(|S(l)|>mi)<^<2-io. 

Now suppose k > 2. If Ai becomes fe-dangerous during iteration t, then it was (A; — l)-dangerous 
at some iteration t < t and rji increased by /3i{k) — (3i{k — 1) in at most 16/s^ iterations. Since 
Var {-ftiAi)) < ai{k — l)s^ when rji G [f3i{k — l),/3i{k)], we get that 

/ 16 \ (0(k)-l3(k-l)f ^ 

Pr Ai becomes fc-dangerous in at most iterations < 2e 4a(fe-i)s^(i6/3^) < g , 

\ ) 2m 

Thus, expected number of A;-dangerous vectors is at most re~^'^^/2 < '[•2^^^^^^^\ Hence, by 
Markov's inequality 

Pr {\S{k)\ > nik) < 2-5(fc+i). 

□ 
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5.1.4 Proof of Lemma 

We first show that the probabihty that the number of non- integer variables is at least r/2 after 
iterations is at most 1/4. Then, we bound the probability that none of the events Dj^ happen. 
Conditioned on the event that none of the events D/. happen, we get that the SDP is feasible for 
every iteration t and the discrepancy incurred is bounded by at most 2/3(1). Since the discrepancy 
overhead incurred by randomized rounding is small, we get Lemma [131 

The following Lemma is identical to that of Lemma 4.1 in [Ij. We give a proof here for the sake 
of completeness. 

Claim 17. Let E he the event that the number of non-integer variables is at least r/2 after 16/s^ 
iterations. Then, 

Proof of Claim [77[ Define for every t = 0, 1, 2, . . . , 16/s^, 



Wt 



wt-i + ^ if|C(t)|<§. 



Now, if \C{t)\ < §, then wt - wt-i = (sV/4). If \C{t)\ > r/2, then 

(n n 
Y,{{xt-i{j)+it{j)f -Y,^t-i{jf 

n 

= Y^Mltij?) (Since E,(7t(i)) = 0) 

jec(t-i) 
^ s^r 

Thus, '&{wt — wt^i) > s'^r/A for every t = 1, 2, . . . , 16/s^. Further, if \C{t)\ > r/2, then vut < r, 

and hence, wt < r + (l/4)ts^r for every t = 1, 2, . . . , 16/s2. Therefore, for to = le/s^, we have that 



tos'^r 



<E{wt,-wo))<Eiujt,) 



<Pr(|C(to)|>-j-r + Pr^|C(to)|<2 

tos^r 



'f\ ( tos^r 



Pr{E)r + {l - Pr{E)) ( r + 
r+'-^]-PriE) 



Thus, Pr (E) < A/tos^ = 1/4. □ 
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Proof of LemmalISi Since jtU) distributed as Gaussian with variance at most s^, by Lemma [21 
the algorithm does not abort in step (3) with high probability. 

Let D = U^^^Dk and E be the event as defined in Claim [T71 Now, 

Pr{D)< Prpfc)<l. 

fc=l,2,3,--- 

Thus, with probability at least 15/16, D holds. If D holds, then the semidefinite program is feasible 
for every t = 0, 1, ... , 16/s^, and hence the algorithm does not abort. Further, if D holds, then 
mk < 1 for k = 21ogm. Hence, there are no (2 log m)-dangerous vectors. Therefore, the discrepancy 
incurred for Ai is at most /3j(21ogm) + 4||Aj'5||. Therefore, the total discrepancy incurred is at 
most 

\Ai{xto -xq)\ < ft (2 log m) + 4 II 
<2ft(l)+4piS|| 

< 1281g pi^ll . 

Now, it is sufficient to show that Pr{DnE) > 1/2. By Claim [HI we have that Pr {E) < 1/4. 
Therefore, 



Pr (D n ^) > 1 - Pr (D) - Pr (E) > 



□ 



5.2 Finding an integer point 

Proof of Theorem^ Without loss of generality, we may assume that xq € [0, 1]" and our objective 
is to find x € {0, l}" with low discrepancy overhead. We use Algorithm Round-IP. We will show 
that it succeeds with probability at least 1/n to find a point x G {0, 1}" such that 

(2m 2m \ 
^/n log h a/ log m log mn log I . 

n log m J 

The success probability of this algorithm can be amplified by repeating it n times. 

Let X denote the vector output the Algorithm Round-IP and let x^ denote the vector x in 
Algorithm Round-IP after k calls to the Partial- Vector algorithm. Let denote the set of non- 
integer coordinates in x^. By Lemma [T3l the discrepancy overhead incurred in the /c'th run of the 
random walk based Partial- Vector algorithm for k G {0, 1, . . . ,log (n/logm)} is 

\Ai^^{xk-Xk-i)\ < 1281 Pi^HI log 



< 2^^o-Vn2-^ log 



2m 



with high probability. Here, the second inequality is by using Lemma fTOl and 15^1 < n2~ 
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Thus, the discrepancy overhead incurred after log (n/ log m) calls to the Partial- Vector algorithm 

is 

log log 

log m log m 2 

\Ai^^{xk-Xk-i)\< Yl 2'V\/^log^ 

k=0 k=0 

< 2^VVnlog 

n 

with high probability. 

For k > log (n/ log m), the number of non-integer coordinates \Sk\ < logm. By Lemma \T3\ 
the discrepancy overhead incurred in the fc'th call to the Partial- Vector algorithm, where k E 
{log (n/ log m) + 1, . . . , log n} is 



\Ai^^{xk-Xk-i)\ < 1281 PiS. II log 



\Sk\ 

2m 



< 2 (j\ n2 ^ log (mn) log 



n2' 



with high probability. Here, the second inequality is by using Lemma and \Sk\ <n2 ^. 
Thus, the discrepancy overhead incurred by Algorithm Round-IP 

fc=logn fc=logn 

' 2m 



Y \Ai^''{xk-Xk^i)\< Y 2i3a-yn2-Mog(mn)log 



n2~'^ 

fc=log y^ fc=log y^ 

log m ^ log m 

< 2^'^(T\/log m log (mn) log — 

logm 

with high probability. 

Since each call to the Partial- Vector algorithm sets at least half of the remaining non-integer 
coordinates to integers, we call the Partial- Vector algorithm at most logn times. Each call succeeds 
with probability 1/2. Hence, with probability at least 1/2'°^"" = 1/n, we obtain an integer point 
X e {0, 1}*^ such that the total discrepancy overhead is bounded as follows: 



(2fn 
\/n log h Y^logmlog (mn) log 
n 



2m 
logm 



□ 



Proof of Theorem O We use Theorem [7] to derive Theorem [3l 
Let 

/ 2m r 2m \ 

Ralg = 2 (7 V"- log h V log ^ log (mn) log 

\ n log m / 

and Qj = Aj/ ||ylj||, i G [m]. Solve the following linear programming problem to find the center of 
the largest ball contained in the polytope. 

maxR 
R < bi — AiX, for i G [m]. 



25 



Let {xo,R) be a solution to the above LP. Since P contains a ball of radius RalGi there exists 
A > Ralg ll^ill for every i G [m] such that 

P D {x G M"| - xo)| < (3i for i G [m]}. 

Observe that the polytope 

Q = {x£ W\ \ai{x - xq)\ < Ralg for i G HI 

is contained in P. We will show that there exists a randomized polynomial-time algorithm to 
find an integer point in Q that succeeds with probability at least (1 — 2me~^/^^) /2. This success 
probability is over the choice of AiS. 

Since each Ai is drawn from a spherically symmetric distribution, Oj = Aj/ ||^j|| is distributed 
uniformly on the unit sphere. A random unit vector Ui on a sphere is obtained by drawing each 
coordinate aij i.i.d. from the normal distribution A^(0, 1/n) and scaling the resulting vector by 



Yl^=i ^Ij- Similar to the proof of Theorem [H Oj gets scaled by at most 2 for every i G [ 



with probability at least 1 — 2me~"/^^. Using Theorem [7| and o"^ = 1/n, we know that there exists 
a randomized polynomial time algorithm that succeeds with high probability to find x G such 
that for every i G [m] 



I ( M^oi5(i 2m /logmlog(mn) 2m 
\ai{x - xo)| < 2'=' log h \ log ■ 



n V n logm / 

Thus, the same randomized polynomial-time algorithm finds a point x G satisfying 



\ai{x - xq)\ I 2m /logmlog(mn) 2m 
\ai{x - xo)| = — n — n — ^ ^ log ^ V log ■ 



n V n logm / 

for every i G [m]. The success probability of the algorithm reduces by a factor of 1 — 2me~"/^^ due 
to the randomness in the input. □ 
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